SPECIFICATION 



TO ALL WHOM IT MAY CONCERN: 

BE IT KNOWN THAT I, Hisashige Ando, a citizen of 
Japan residing at Kawasaki, Japan have invented certain new 
and useful improvements in 



INFORMATION-PROCESSING DEVICE THAT EXECUTES 
GENERAL- PURPOSE PROCESSING AND TRANSACTION 

PROCESSING 



of which the following is a specification 
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TITLE OF THE INVENTION 

INFORMATION-PROCESSING DEVICE THAT 
EXECUTES G^ERAL- PURPOSE PROCESSING AND TRANSACTION 
PROCESSING 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

The present invention generally relates to 
an information-processing device, and more 
particularly relates to an information-processing 
device that executes both general-purpose processing 
and transaction processing of data. 

2. Description of the Related Art 
Current high-performance general -purpose 

processors are usually designed to rapidly execute 
instructions suitable for a wide variety of 
applications such as commercial applications, 
scientific applications and multimedia processing. 
For instance, a current high-performance general- 
purpose processor includes a floating-point unit so 
as to rapidly execute scientific applications that 
require execution of a large amount of floating- 
point arithmetic operations. In general, high- 
performance general-purpose processors are designed 
to execute a single program consisting of a series 
of instructions. In order to execute the series of 
instructions rapidly, each high-performance general- 
purpose processor includes a function to predict a 
branch outcome for executing a branch instruction 
before a branch direction is known, a reservation 
station for efficient out-of-order execution of 
instructions, and a reorder buffer for keeping track 
of results from the instructions executed out of 
order . 

However, recently, a demand for computers 
has been increasing especially in areas of systems 
that execute transaction processing and web server 
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processing. Operations executed in the transaction 
processing and the web server processing are mainly 
logical operations and integer operations so that a 
frequency of executing the floating-point arithmetic 
5 operations is low in the transaction processing and 
the web server processing. In addition, in the 
transaction processing and the web server processing, 
a performance to concurrently execute a large number 
of small-sized processes takes priority over a 
10 performance to execute a single large-sized program 
rapidly . 

As described above, the current high- 
performance general-purpose processors are designed 
>^ to achieve high performance over a wide computing 

15 area, and thus include a large amount of hardware 

3 

^ inefficiently utilized for executing the transaction 

0 processing and the web server processing. 

2 Additionally, a large amount of hardware required 

~ for rapid execution of a single large-sized 

H 20 sequential program is overkill for concurrent 

execution of large quantities of simple processes as 

£ y 

01 required in the transaction processing and the web 

g server processing without carrying overhead hardware 

^ for functions rarely required. Therefore, there has 

25 been a demand for an information-processing device 
that can rapidly execute the transaction processing 
and the web server processing. 

A description will now be given of a 
conventional high-performance general -purpose 

30 processor with reference to FIG. 1. A high- 
performance general-purpose processor 1 shown in FIG. 
1 includes an instruction cache and instruction 
control unit 2, a branch-prediction unit 3, a 
program counter 4, a checkpoint unit 5, a fixed- 

35 point register file 6, a floating-point register 

file 7, a fixed-point reorder buffer 8, a floating- 
point reorder buffer 9, a fixed-point reservation 
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station 10, a floating-point reservation station 11, 
a load/store reservation station 12, a fixed-point 
unit 13, a floating-point unit 14, a load/store unit 
15 and a data cache 16. 
5 The instruction cache and instruction 

control unit 2 stores instructions, as well as 
fetches and distributes instructions to be executed. 
Each unit in the high-performance general-purpose 
processor 1 is controlled by a result of decoding 
10 fetched instructions. The branch-prediction unit 3 
detects a conditional branch instruction from 
decoded instructions, and predicts a direction of 
^ each branch before a branching condition is known. 

^ The program counter 4 points to an address of an 

^ 15 instruction being executed in order. The checkpoint 

J unit 5 stores a processor status including register 

0 values and the like when a branch instruction ' is 
%I executed based on a branch prediction. If the 

J" branch prediction has been proved wrong, the high- 

H 20 performance general-purpose processor 1 can recover 

isi the processor status including the register values 

01 before branching by reading information stored in 

2 the checkpoint unit 5, thereby recovering from a 
^ branch prediction error. 

25 The fixed-point register file 6 stores 

operands for fixed-point operations, and supplies 
the operands necessary to execute the operation to 
the fixed-point reservation station 10. 
Additionally, the fixed-point register file 6 stores 

30 operands for load/store instructions, and supplies 

the operands necessary to execute the instruction to 
the load/store reservation station 12. The fixed- 
point reorder buffer 8 stores results of 
calculations executed by the fixed-point unit 13 and 

35 results of load instructions executed by the 

load/store unit 15. Additionally, the fixed-point 
reorder buffer 8 has capability of rectifying an 
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order of generated results as if they were generated 
in order, and supplies correct operands to 
corresponding reservation stations . 

The floating-point register file 7 stores 
5 operands for floating-point operations, and supplies 
the operands necessary to execute the operation to 
the floating-point reservation station 11. 
Additionally, the floating-point register file 7 
stores operands for load/store instructions, and 
10 supplies the operands necessary to execute the 

instruction to the load/store reservation station 12. 
The floating-point reorder buffer 9 stores results 
^ of calculations executed by the floating-point unit 

^ 14 and results of a load instruction executed by the 

SJ 15 load/store unit 15. Additionally, the floating- 

^ point reorder buffer 9 has capability of rectifying 

p an order of generated results as if they were 

ff^ generated in order, and supplies correct operands to 

corresponding reservation stations. 
M: 20 The fixed-point reservation station 10 

nj stores fixed-point instructions used for executing 

~ fixed-point operations. The fixed-point reservation 

□ station 10 waits for operands necessary to execute 

fixed-point instructions to be supplied from the 
25 fixed-point register file 6, from the fixed-point 
reorder buffer 8, or directly from the fixed-point 
unit 13 or the load/store unit 15. The fixed-point 
instructions stored in the fixed-point reservation 
station 10 become executable after receiving all the 
30 operands necessary to execute the instructions. In 
such a case, the fixed-point reservation station 10 
selects executable instructions, and supplies the 
executable instructions to the fixed-point unit 13. 
The maximum number of fixed-point instructions the 
35 fixed-point reservation station 10 can supply 

simultaneously to the fixed-point unit 13 is equal 
to the number of fixed-point operation modules 
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provided in the fixed-point unit 13. It should be 
noted that the fixed-point unit 13 includes a 
plurality of the fixed-point operation modules, each 
fixed-point operation module executing the 
5 instructions supplied from the fixed-point 
reservation station 10. 

The floating-point reservation station 11 
stores floating-point instructions used for 
executing floating-point operations. The floating- 
10 point reservation station 11 waits for operands 

necessary to execute floating-point instructions to 
be supplied from the floating-point register file 7, 
^ from the floating-point reorder buffer 9, or 

2 directly from the floating-point unit 14 or the 

SI 15 load/store unit 15. The floating-point instructions 

J Stored in the floating-point reservation station 11 

Q become executable after receiving all the operands 

2 necessary to execute the instructions. In such case, 

the floating-point reservation station 11 selects 
20 executable instructions, and supplies the executable 
instructions to the floating-point unit 14. The 
^ maximum number of floating-point instructions the 

O floating-point reservation station 11 can supply 

simultaneously to the floating-point unit 14 is 
25 equal to the number of floating-point operation 

modules provided in the floating-point unit 14 . It 
should be noted that the floating-point unit 14 
includes a plurality of the floating-point operation 
modules, each floating-point operation module 
30 executing the instructions supplied from the 
floating-point reservation station 11. 

The load/store reservation station 12 
stores load/store instructions. The load/store 
reservation station 12 waits for operands necessary 
35 to execute load/store instructions to be supplied 

from the fixed-point register file 6, the floating- 
point register file 7, the fixed-point reorder 
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buffer 8, or the floating-point reorder buffer 9. 
The load/store instructions become executable after 
receiving all the operands necessary to execute the 
instructions. In such case, the load/store 
5 reservation station 12 selects executable 
instructions, and supplies the executable 
instructions to the load/store unit 15. The maximum 
number of load/store instructions the load/store 
reservation station 12 can supply to the load/store 

10 unit 15 is equal to the number of load/store 

operation modules provided in the load/store unit 15 
It should be noted that the load/store unit 15 
includes a plurality of the load/store operation 
modules, each load/store operation module executing 

15 the instructions supplied from the load/store 
reservation station 12. 

The fixed-point unit 13 executes the 
fixed-point instructions supplied from the fixed- 
point reservation station 10, and sends results of 

20 executed fixed-point operations to the fixed-point 
reorder buffer 8, the fixed-point reservation 
station 10 and the load/store reservation station 12 
Similarly, the floating-point unit 14 executes the 
floating-point instructions supplied from the 

25 floating-point reservation station 11, and sends 

results of executed floating-point operations to the 
floating-point reorder buffer 9, the floating-point 
reservation station 11 and the load/store 
reservation station 12. Additionally, the 

30 load/store unit 15 executes the load/store 
instructions supplied from the load/store 
reservation station 12, and sends results of 
executed load/store operations to the fixed-point 
reorder buffer 8 and the fixed-point reservation 

35 station 10, or to the floating-point reorder buffer 
9 and the floating-point reservation station 11, 
The load/store unit 15 writes data in the data cache 



16 when executing a storing instruction. The 
load/store unit 15 reads data from the data cache 16 
when executing a load instruction. 

The high-performance general -purpose 
processor 1 can execute both fixed-point 
arithmetic/logical operations and floating-point 
arithmetic operations by including the fixed-point 
unit 13 and the floating-point unit 14. 
Additionally, the high-performance general-purpose 
processor 1 includes the branch-prediction unit 3, 
the fixed-point reorder buffer 8, the floating-point 
reorder buffer 9, the fixed-point reservation 
station 10, the floating-point reservation station 
11 and the load/store reservation station 12 so that 
the high-performance general-purpose processor 1 can 
predict a branch direction for executing a branch 
instruction before a branch direction is known, and 
can produce results as if instructions are executed 
in order when real execution takes advantage of out- 
of-order execution . 

As described above, the high-performance 
general-purpose processor 1 is designed to rapidly 
execute various instructions over the wide computing 
area, and to execute sequential programs rapidly by 
using branch prediction and out-of-order execution. 
The high-performance general-purpose processor 1 
also has multiple operation modules in the fixed- 
point unit 13, in the floating-point unit 14, and in 
the load/store unit 15. With those operation 
modules, the high-performance general-purpose 
processor 1 can execute six to eight instructions 
simultaneously . 

FIG. 2 is a block diagram showing a multi- 
thread transaction processing system. A multi- 
thread transaction processing system 17 using a 
multi-thread method includes transaction processors 
18-1 through la-n, a memory 19, an input/output 
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interface 20 and a system bus 21. 

Each of the transaction processors 18-1 
through 18 -n includes a plurality of program 
counters for multi- thread processing, thereby 
5 executing transaction processes efficiently. The 
memory 19 is connected through the system bus 21 to 
the transaction processors 18-1 through 18-n, and is 
universally used among the transaction processors 
18-1 through 18-n. The input/output interface 20 
10 provides an interface connecting the system bus 21 
and peripheral devices located outside the multi- 
thread transaction processing system 17. 
^ FIG. 3 is a block diagram showing a 

transaction processor. The transaction processor 
s| 15 18-1 shown in FIG. 3 includes an instruction cache 

£ and instruction control unit 22, program counters 23, 

Q register files 24, a fixed-point unit 25, a 

01 load/store unit 26 and a data cache 27. It should 

J" be noted that the instruction cache and instruction 

' 20 control unit 22, the program counters 23, the 

register files 24, the fixed-point unit 25, the 
load/store unit 26 and the data cache 27 correspond 
□ respectively to the instruction cache and 

^ instruction control unit 2, the program counter 4, 

25 the fixed-point register file 6, the fixed-point 

unit 13, the load/store unit 15 and the data cache 
16 shown in FIG. 1. In addition, each of the 
transaction processors 18-1 through 18-n has a 
structure shown in FIG. 3. In this embodiment, a 
30 floating-point unit is not implemented since cost of 
hardware is too high compared to performance gain in 
transaction processing, which does not require 
floating-point calculations often. Furthermore, the 
transaction processors 18-1 through 18-n are 
35 designed mainly to simultaneously execute a large 
number of programs, for instance, the number of 
processors multiplied by the number of program 
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counters in each processor, rather than to execute a 
single program rapidly, and thus do not include 
functions to predict branch directions, reorder 
buffers and reservation stations • Instead, each of 
5 the transaction processors 18-1 through 18-n 

includes a plurality of program counters 23 and the 
register files 24 for multi- threading . Taking ATM 
transaction processing as an example, A single 
process using a single program counter in a 

10 transaction processor can handle a withdrawal for a 
user A, at the same time, a withdrawal for a user B 
by using another program counter. Consequently, 
each of the transaction processors 18-1 through 18-n 
can execute a plurality of ATM transaction processes 

15 simultaneously . 

Recently, processor speed has been 
increasing substantially, whereas memory speed has 
not increased as the processor speed. Such a 
condition makes a long memory access time and a 

20 resulting processor stall major performance 

impediments. In a case that a waiting period to 
access a memory happens to the transaction processor 
18-1 while executing a series of instructions for a 
first process, the transaction processor 18-1 can 

25 execute another series of instructions for a second 
process by using another program counter shown in 
FIG. 3, thereby improving overall efficiency of 
utilizing hardware resources. As described above, 
the transaction processor 18-1 shown in FIG. 3 can 

30 execute transaction processes efficiently. 

The multi-thread transaction processing 
system 17 shown in FIG. 2 can execute a large number 
of processes simultaneously, and can execute 
transaction processes efficiently by including the 

35 transaction processors 18-1 through 18-n. In 

addition, a size of each transaction processor shown 
in FIG. 3 is smaller than the high-performance 
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general-purpose processor so that multiple 
transaction processors can be fabricated on a single 
chip . 

The high-performance general -purpose 
5 processor 1 shown in FIG. 1 includes many functions 
that are not cost-effective to execute simple 
transaction processes and web server processes as 
the high-performance general-purpose processor 1 is 
designed to execute a wide variety of applications 
10 efficiently. Additionally, the high-performance 
general-purpose processor 1 is less efficient for 
execution of large quantities of simple processes 
_ necessary in the transaction processes and the web 

._fz server processes, since the high-performance 

%l 15 general-purpose processor 1 is designed to execute a 

£ single program at a time. Even if an attempt to 

y i 

Q provide a plurality of high-performance general- 

ff^ purpose processors 1 in a system is made, the number 

of concurrently executed processes is less since the 
H 20 number of the high-performance general-purpose 

rU processors 1 that can be fabricated on a single chip 

is less because of its large circuit size. 
D On the other hand, in a case that a system 

^ includes only the multi-thread transaction 

25 processing system 17 shown in FIG. 2, the system can 
improve efficiency to execute transaction processes 
and/or web server processes, but cannot rapidly 
execute complicated processes such as a scientific 
arithmetic operation that needs floating-point 
30 arithmetic operations or a large process blocking 
execution of other small processes. Further, the 
system including only the multi-thread transaction 
processing system 17 does not have various functions 
to execute a single program rapidly. Accordingly, 
35 in a case that the system is asked to execute a 

single large-sized process or a complicated process, 
efficiency to execute such a process by the system 



-11- 



decreases remarkably . 

SUMMARY OF THE INVENTION 

Accordingly, it is a general object of the 
present invention to provide an information- 
processing device capable of processing a variety of 
instructions efficiently. A more specific object of 
the present invention is to provide an information- 
processing device including processors, wherein the 
processors divide execution of instructions among 
them, thereby efficiently executing the instructions 

The above-described object of the present 
invention is achieved by an information-processing 
device that executes a specific process more 
frequently than other processes among a variety of 
processes, the information-processing device 
including a first processor capable of executing an 
instruction set corresponding to the variety of 
processes, and a second processor capable of 
executing a portion of or the entire instruction set 
the second processor being capable of executing a 
part of the instruction set corresponding to the 
specific process more efficiently than the first 
processor, wherein the second processor executes the 
specific process whereas the first processor 
executes the other processes • 

Accordingly, the information -processing 
device can execute a variety of instructions 
efficiently. 

Other objects, features and advantages of 
the present invention will become more apparent from 
the following detailed description when read in 
conjunction with the accompanying drawings. 

BRIEF. DESCRIPTION OF THE DRAWINGS 

FIG, 1 is a block diagram showing a high- 
performance general -purpose processor; 
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FIG. 2 is a block diagram showing a multi- 
thread transaction processing system; 

FIG. 3 is a block diagram showing a 
transaction processor; 
5 FIG. 4 is a system diagram showing an 

embodiment of the present invention; 

FIG. 5 is a block diagram showing a 
transaction processor in the embodiment of the 
present invention ; 
10 FIG. 6 is a flowchart showing processes to 

detect instructions lacking capabilities in the 
transaction processor in the embodiment of the 
present invention ; 

FIG. 7 is a flowchart showing an 
15 interrupting process in preparation to transfer 
execution of a process to the other type of a 
processor performed by the transaction processor in 
the embodiment of the present invention; 

FIG. 8 is a flowchart showing processes 
20 performed by a general-purpose processor to accept 
transferred execution of the process in the 
embodiment of the present invention; 

FIG. 9 is a block diagram showing an 
instruction-allocating process in the embodiment of 
25 the present invention; and 

FIG. 10 is a flowchart showing processes 
performed by a processor-allocation control. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 
30 A description will now be given of 

preferred embodiments of the present invention, with 
reference to the accompanying drawings . 

FIG. 4 is a system diagram showing an 
embodiment of the present invention. An 
35 information-processing device 31 according to the 

embodiment includes general-purpose processors 32-1 
and 32-2, transaction processors 33-1 and 33-2, a 



-13- 



main memory 34, an Input/output interface 35, a 
system controller 36 and a system bus 37. 

Each of the general-purpose processors 32- 
1 and 32-2 are processors similar to the high- 
5 performance general-purpose processor 1 shown in FIG. 
1. In other words, each of the general-purpose 
processors 32-1 and 32-2 can execute all types of 
processes rapidly. As each of the general-purpose 
processors 32-1 and 32-2 can predict a branch 
10 direction for executing a branch instruction before 
a branch direction is known, can execute 
instructions out of order. It should be noted that 
^ the number of the general-purpose processors 32-1 

3 and 32-2 included in the information-processing 

'"-J 15 device 31 is not limited to two. Thus, any number 

5 of the general-purpose processors can be included in 

p the information-processing device 31 depending on a 

Oj percentage of load that the other type of a 

2^' processor cannot handle efficiently. 

M= 20 The transaction processors 33-1 and 33-2 

iH can execute a large number of processes 

^ simultaneously. However, a speed of executing each 

Q process is slower than that of the general-purpose 

^ processors 32-1 and 32-2. Additionally, the 

25 transaction processors 33-1 and 33-2 may lack 

capability of executing certain instructions that 
are not important for transaction processing. 
Besides, each of the transaction processors 33-1 and 
33-2 is designed to be a multi-thread system and/or 
30 a system including multiple processors, and thus can 
execute transaction processes efficiently. It 
should be noted that the number of the transaction 
processors 33-1 and 33-2 included in the 
information-processing device 31 is not limited to 
35 two. The information-processing device 31 may 

include any number of the transaction processors as 
the ratio of the load require. 
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The main memory 34 includes a RAM (Random 
Access Memory), and stores data therein. 
Additionally, the main memory 34 is connected 
through the system bus 37 to the general-purpose 
5 processors 32-1 and 32-2, and the transaction 

processors 33-1 and 33-2. Accordingly, the data 
stored in the main memory 34 is shared among the 
processors 32-1, 32-2, 33-1 and 33-2. 

The input/output interface 35 provides an 
10 interface connecting the system bus 37 and 

peripheral devices located outside the information- 
processing device 31. The system controller 36 
_ controls the system bus 37, the main memory 34 and 

2 the input/output interface 35 according to a request 
%| 15 to access the main memory 34 or the input/output 

J interface 35 through the system bus 37 from the 

3 general -purpose processors 32-1 and 32-2, and the 

transaction processors 33-1 and 33-2. 

FIG. 5 is a/o^lock diagram showing a 
^ 20 structure of the transaction processor 33-1, The 

j=f transaction processor 33-1 shown in FIG. 5 includes 

5 a transaction-processing unit 41 and a secondary 

3 cache data memory 42. The transaction-processing 

=^ unit 41 is fabricated on a single semiconductor chip 

25 whereto the secondary cache data memory 42 in this 
embodiment is provided outside the semiconductor 
chip. The secondary cache data memory 42 may be a 
RAM, and temporarily stores data and instructions 
therein . 

30 The transaction-processing unit 41 

includes processor cores 43-1 through 43-m, a 
secondary cache tag 44, an interface 45 and an 
internal bus 46. The processor cores 43-1 through 
43-m are connected through the internal bus 46 to 

35 the secondary cache tag 44 and the interface 45. 
The secondary cache tag 44 manages usage of the 
secondary cache data memory 42. The interface 45 
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provides an interface connecting the secondary cache 
data memory 42 and the internal bus 46. 

The processor cores 43-1 through 43-m may 
have the same structure as the transaction processor 
5 18-1 shown in FIG. 3. In other words, each of the 
processor cores 43-1 through 43-m is multi- threaded 
so as to efficiently execute transaction processes - 
Additionally, each of the processor cores 43-1 
through 43-m does not necessarily need to include 

10 functions to execute complicated arithmetic 
operations such as floating-point arithmetic 
operations. Furthermore, each of the processor 
cores also does not necessarily need to include 
functions to predict branch directions for executing 

15 a branch instruction before the branch direction is 
known, or functions to execute instructions out of 
order. However, each of the processor cores 43-1 
through 43-m is not necessarily multi- threaded since 
the transaction processors 33-1 and 33-2 can execute 

20 concurrent transaction processes by including a 

plurality of the processor cores 43-1 through 43-m. 
Even in an extreme case that each of the transaction 
processors 33-1 and 33-2 includes only the processor 
core 43-1, each of the transaction processors 33-1 

2 5 and 33-2 can execute the concurrent transaction 

processes if the processor core 43-1 is specialized 
in transaction processing with multithreading as 
described with reference to FIG. 3. 

A description will now be given of 

30 processes performed by the transaction processors 
33-1 and 33-2. FIG. 6 is a flowchart showing 
processes performed by the transaction processor 
according to the embodiment of the present invention. 
The transaction processor 33-1 executes steps SlOl 

35 through S105. 

At the step SlOl, the transaction 
processor 33-1 initially issues an instruction. 
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Subsequently , at the step S102, the transaction 
processor 33-1 decides whether the instruction is 
directing execution of a hardware -implemented 
operation, for example, a fixed-point arithmetic 
5 operation. If it is ascertained at the step S102 
that the instruction is directing execution of the 
hardware- implemented operation, the transaction 
processor 33-1 proceeds to the step S103, and 
executes the instruction. Subsequently, at the step 
10 S104, the transaction processor 33-1 decides whether 
there is a next instruction to be executed. If it 
is ascertained at the step S104 that there is the 
next instruction to be executed, the transaction 
y processor 33-1 proceeds to the step SlOl, and issues 

vj 15 the next instruction. If it is ascertained at the 

±; Step S104 that the next instruction does not exist, 

the transaction processor 33-1 terminates the above- 
-31 described processes. 

If it is ascertained at the step S102 that 
20 the instruction is directing execution of an 
ry operation not implemented in hardware, the 

It' transaction processor 33-1 proceeds to the step S105, 

P and generates an interrupt. After generating the 

O interrupt, the transaction processor 33-1 terminates 

25 the above-described processes. 

A description will now be given of an 
interrupt handling process executed by the 
transaction processor 33-1. FIG. 7 is a flowchart 
showing the interrupt handling process executed by 
30 the transaction processor according to the 
embodiment of the present invention. The 
transaction processor 33-1 executes steps S201 
through S206 as the interrupt handling process. 

At the step S201, the transaction 
35 processor 33-1 initially analyzes interrupt 

information that has been generated at the step S105. 
Subsequently, at the step S202, the transaction 
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processor 33-1 decides whether an incoming interrupt 
is a regular interrupt or not. If it is ascertained 
at the step S202 that the incoming interrupt is the 
regular interrupt, the transaction processor 33-1 
5 proceeds to the step S203, and executes the regular 
interrupt handling process. If it is ascertained at 
the step S202 that the interrupt is for executing a 
non hardware -implemented operation, for example, a 
floating-point arithmetic operation, the transaction 

10 processor 33-1 proceeds to the step S204. At the 

step S204, the transaction processor 33-1 stores an 
internal condition of the transaction processor 33-1 
necessary for transferring execution of the on 
hardware -implemented operation, in a main memory. 

15 After storing the internal condition, the 

transaction processor 33-1 proceeds to the step S205, 
then stores an instruction address of the 
instruction that has been issued at the step SlOl 
and a memory address of the internal condition 

20 stored in the memory, in internal registers of the 
transaction processor 33-1, and interrupts the 
general-purpose processor 32-1 or 32-2. To be 
concrete, the transaction processor 33-1 interrupts 
one of the general-purpose processors 32-1 and 32-2 

2 5 by supplying an interrupt vector including contents 
of the above-described internal registers to one of 
the general-purpose processors 32-1 and 32-2. As 
described above, in a case that the transaction 
processor 33-1 or 33-2 detects an instruction to 

30 execute a non hardware- implemented operation, for 

example, a floating-point arithmetic operation, the 
transaction processor 33-1 or 33-2 interrupts one of 
the general-purpose processors 32-1 and 32-2. 

A description will now be given of 

35 processes performed by the general-purpose processor 
32-1. FIG. 8 is a flowchart showing processes 
executed by the general-purpose processor according 
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to the embodiment of the present invention. The 
general-purpose processor 32-1 executes steps S301 
through S304. 

Upon receiving an interrupt request from 
5 other processors such as the transaction processors 
33-1, 33-2, the input/output interface 35 or 
internal units of the general-purpose processor 32-1, 
the general-purpose processor 32-1 starts executing 
processes transferred from the other type of a 

10 processor, at the step S301. 

At the step S301, the general-purpose 
processor 32-1 decides whether the interrupt request 
is a request to the general-purpose processor 32-1 
for executing a non hardware -implemented instruction 

15 transferred from one of the transaction processors 
33-1 and 33-2. If it is ascertained at the step 
S301 that the interrupt request is not the request 
to the general-purpose processor 32-1 for executing 
an instruction transferred from one of the 

20 transaction processors 33-1 and 33-2, the general- 
purpose processor 32-1 executes a regular interrupt 
handling process at the step S302, and then 
terminates the above-described processes. 

If it is ascertained at the step S301 that 

2 5 the interrupt request is the request to the general- 
purpose processor 32-1 for executing an instruction 
transferred from one of the transaction processors 
33-1 and 33-2, the general-purpose processor 32-1 
proceeds to the step S303, and obtains the interrupt 

30 vector supplied with the interrupt request, for 
instance, from the transaction processor 33-1. 
Additionally, the general-purpose processor 32-1 
obtains the instruction address and a memory address 
from the interrupt vector. The general-purpose 

35 processor 32-1 copies the internal condition of the 
transaction processor 33-1 therein by using the 
memory address where the transaction processor 33-1 
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has stored the internal condition. Subsequently, at 
the step S304, the general-purpose processor 32-1 
executes the instruction located at the above 
instruction address. In other words, the general- 
purpose processor 32-1 executes instructions 
starting from the instruction requested by the 
transaction processor 33-1. As described above, 
each of the general-purpose processors 32-1 and 32-2 
executes a non hardware -implemented operation, for 
example, a floating-point arithmetic operation, and 
instructions following the instruction that could 
not be executed by the transaction processor 33-1 or 
33-2. 

According to the embodiment of the present 
invention as described above, in a case that an 
instruction to execute a non hardware -implemented 
operation such as a floating-point arithmetic 
operation is detected in the transaction processor 
33-1 or 33-2, the transaction processor 33-1 or 33-2 
in which the non hardware -implemented operation is 
detected interrupts the general-purpose processors 
32-1 or 32-2 to let the general-purpose processors 
32-1 or 32-2 execute the non hardware -implemented 
operation. In the embodiment, execution of the non 
hardware-implemented operation is transferred from 
the transaction processor 33-1 to the general- 
purpose processor 32-1. However, any instructions 
executed inefficiently by the transaction processor 
but efficiently by the general-purpose processor can 
be transferred from the transaction processor to the 
general-purpose processor in addition to non 
hardware- implemented operations. The present 
invention includes such embodiments. 

In the present invention, a transaction 
process can be executed efficiently by transaction 
processors. However, an OS (Operating System) may 
be executed more efficiently by one of the general- 
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purpose processors 32-1 and 32-2 since general- 
purpose processors can execute a single instruction 
stream faster than the transaction processors. 
However, selection of a processor executing the OS 
5 depends on efficiency of executing the OS, and thus 
the OS is not necessarily executed by one of the 
general-purpose processors 32-1 and 32-2. In other 
words, the OS may be executed by one of the 
transaction processors 33-1 and 33-2. Additionally, 
10 by sharing the main memory 34 among the general- 
purpose processors 32-1 and 32-2, and the 
transaction processors 33-1 and 33-2, the 
^ information-processing device 31 can pass a process 

2 efficiently between the processors. 

15 A description will now be given of a 

£ process-allocating method for allocating processes 

□ to processors. FIG. 9 is a diagram showing a 

^ process-allocating function according to the 

embodiment of the present invention. The OS manages 
H 20 allocation of processes or threads stored in a 

ij^ process queue 71. This process -allocating function 

m ot the OS is achieved by a dispatcher 72. The 

O process queue 71 stores processes to be executed by 

™ processors. The dispatcher 72 monitors a process 

25 indicated by the process queue 71 as well as 

monitors the general-purpose processors 32-1 and 32- 
2, and the transaction processors 33-1 and 33-2. 
The dispatcher 72 then allocates the process 
indicated by the process queue 71 to one of the 
30 processors. Since the system according to the 
present invention mainly executes transaction 
processes, all the processes are preferably 
allocated initially to the transaction processors 
33-1 and 33-2 instead of the general-purpose 
35 processors 32-1 and 32-2. Alternatively, 

transaction processes may be allocated to the 
transaction processors 33-1 and 33-2, whereas 
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processes initially not proved to be the transaction 
processes may initially be allocated to the general- 
purpose processors 32-1 and 32-2. Execution of 
allocated processes will be suspended by a system 
5 call (an interrupt by the OS) requesting 

input/output operations or requesting some system 
resources that are occupied by some other process. 
A system call/trap handler 73 suspends execution of 
processes according to contents of an interrupt when 
10 having received the system call from the process 
being executed or one of the general-purpose 
processors 32-1 and 32-2, and the transaction 
processors 33-1 and 33-2. Interrupt requesting not 

o 

~Z implemented in hardware described in FIG. 7 is one 

%| 15 of the cause of process suspension, and enters into 

j system call/ trap handling. In addition, if 

2 execution of a certain process on one of the 

yl transaction processors 33-1 and 33-2 is determined 

" inefficient such by a performance monitor, the 

Lx 20 process may issue a system call asking the execution 

ry be transferred to one of the general purpose 

1^ processors 32-1 and 32-2. 

Q FIG. 10 is a flowchart showing the 

O process-allocating function according to the 

25 embodiment. The system call/trap handler 73 

executes steps S401 through S405 shown in FIG. 10. 
The system call/trap handler 73 is invoked by 
detection of interrupts on system calls from the 
general-purpose processors 32-1 and 32-2, and the 
30 transaction processors 33-1 and 33-2. The cause of 
an interrupt on a system call is analyzed at the 
step S401, the system call/trap handler 73 decides 
whether the processor can resume executing a series 
of instructions in the process being executed by the 
35 processor from an interrupted instruction, at the 
step S402. If it has been determined at the step 
S402 that the series of instructions in the process 
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can be resumed by the processor from the interrupted 
instruction, the system call/trap handler 73 marks 
the process for executing on the same processor or 
the same type of the processor at the step S403, 
5 Subsequently, at the step S404, after the cause of 
suspension is cleared, the system call/trap handler 
73 stores the process to which the information has 
been added in the process queue 71. The dispatcher 
72 selects the process in the process queue 71 as it 

10 becomes the first priority for execution and tries 
to allocate to the same processor as it has been 
executing* However, in a case that the processor 
that has been executing the process is now being 
used for executing another process, the process is 

15 allocated to another thread of the processor, or to 
another processor if they are available. 

If it is determined at the step S402 that 
the process cannot or inefficient to be resumed by 
the same processor, the system call/trap handler 73 

20 makes the process that limits a type of a processor 
executing the process to a general-purpose processor 
at the step S405, Subsequently, the system 
call/trap handler 73 executes the step S404. 

The dispatcher 72 allocates a process to a 

25 specified processor if entry in the process queue 71 
includes the information that limits a type of a 
processor executing the process. For instance, in a 
case that a process including a floating-point 
arithmetic operation has been initially allocated to 

30 a transaction processor, an interrupt occurs in the 
process. Subsequently, the process is stored with 
information specifying a general-purpose processor 
in the process queue 71. Then the dispatcher 72 
allocates the process to one of the general-purpose 

35 processors 32-1 or 32-2. 

As described above, the system according 
to the present invention can efficiently execute 
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processes as a whole by allocating all the processes 
to one of the transaction processors 32-1 and 32-2 
since main processes executed by the system are 
transaction processes. Alternatively, the system 
5 may allocate transaction processes to a transaction 
processor, and processes not proved to be the 
transaction processes to a general-purpose processor 
initially. In addition, allocation of processes is 
desirably optimized according to performances of the 

10 general-purpose processors 32-1 and 32-2, and the 
transaction processors 33-1 and 33-2. In the 
embodiment, instruction sets may be the same for the 
general-purpose processors 32-1 and 32-2, and the 
transaction processors 33-1 and 33-2. 

15 In the above-described embodiment, a 

description has been given of the system wherein the 
general-purpose processors 32-1 and 32-2, and the 
transaction processors 33-1 and 33-2 share the 
system bus 37 and the main memory 34. 

20 In the above-described invention, there 

are two types of processors provided in an 
information-processing device. One of the 
processors is a general-purpose processor mainly 
designed to demonstrate high performance over a wide 

25 computing area, and to rapidly execute a single 

program. The other is a multi-thread and/or multi- 
processing transaction processor mainly designed to 
execute a large number of small-sized processes 
concurrently. A transaction process is allocated to 

30 the transaction processor, A process that cannot be 
executed efficiently by the transaction processor is 
allocated to the general-purpose processor. 
Accordingly, the present invention provides an 
information-processing device that can efficiently 

35 execute a variety of processes. 

Additionally, for instance, in a case that 
an instruction to execute a floating-point 
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arithmetic operation not Implemented in hardware is 
detected in a process being executed by the 
transaction processor, the transaction processor 
interrupts the general-purpose processor. 
5 Subsequently, the general-purpose processor executes 
remaining instructions in the process starting from 
the instruction to execute the floating-point 
arithmetic operation. Instructions passed from the 
transaction processor to the general-purpose 
10 processor are the instructions that cannot be 

executed efficiently by the transaction processor 
but can be executed efficiently by the general- 
^ purpose processor. Such instructions are most 

2 likely the instructions for executing floating-point 

Sj 15 arithmetic operation, but are not limited to the 

pa 

5 floating-point instructions. 

□ A process may be judged inefficient to be 
J executed by one type of a processor, by monitoring a 

performance counter or by some other means. In such 
=^ 20 a case, a transfer of execution of the process may 

be requested through a system call to optimize 
overall performance of the entire system. 

□ The above description is provided in order 
to enable any person skilled in the art to make and 

25 use the invention and sets forth the best mode 

contemplated by the inventors of carrying out the 
invention. 

The present invention is not limited to 
the specially disclosed embodiments and variations, 
30 and modifications may be made without departing from 
the scope and spirit of the invention. 

The present application is based on 
Japanese Priority Application No, 2000-348475, filed 
on November 15, 2000, the entire contents of which 
35 are hereby incorporated by reference. 



